Compensation for goos-hanchen error in autofocus systems

ABSTRACT

Prediction of a distribution of light in an illumination pupil of an illumination system includes identifying component(s) of the illumination system the adjustment of which affects this distribution and simulating the distribution based on a point spread function defined in part by the identified components. The point spread function has functional relationship with configurable setting of the illumination settings.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation from U.S. patent application Ser. No. 14/302,187, now published as U.S. 2014/0293278, which is a continuation from U.S. patent application Ser. No. 12/884,890, now abandoned, which in turn claims the benefit of and priority from the U.S. Provisional Patent Application No. 61/244,321, filed Sep. 21, 2009, which provisional application is incorporated by reference herein.

BACKGROUND

The present invention provides a method for compensating errors due to the Goos-Hanchen effect in an autofocus (AF) system.

The Goos-Hanchen (GH) effect produces a shift of a beam when incident on an optical interface (e.g. a substrate that is imaged by an imaging optical system in the production of a semiconductor wafer). In one way of looking at this effect, any monochromatic beam incident on a reflecting surface can be decomposed into a sum of plane waves. The reflecting surface (e.g. the substrate surface) then produces a different phase for each plane wave depending on its angle of incidence. Very often, over a small range of angles, this phase on reflection will either increase or decrease with the angle of incidence producing a tilted wavefront in the far field, which is the same as a shifted spot at the reflecting surface—the near field. In an imaging optical system that includes a reflective surface near an image, this effect will produce a shift of the image. This is also true in an autofocus system that images some source object (e.g. a slit or fringes) onto the surface of investigation (e.g. a wafer) at a glancing angle of incidence and then relays that image to a detector. The position of the image on the detector will depend on the height of the surface of investigation, but will also depend on the variation of phase on reflection produced by that surface—the GH effect. In an AF system, this means that variations in the surface construction, which may consist of many thin film layers and printed circuit patterns, will produce an error in the surface height measurement; we call this the GH error.

The problem with the GH error is that it can vary with underlying substrate patterns, and coating thicknesses, and that variation can be large, e.g. several hundred nanometers to several microns. Moreover, that variation is typically indistinguishable from the substrate (substrate) topography in an optically based AF system.

One approach to compensating the GH-effect is to use ellipsometry to determine the substrate film structure, and then use the film structure to estimate the GH error, and finally subtract that error from the measured surface height. However, ellipsometry requires a complex optical system of its own, a big increase in computational power, and a lot of input from the user.

SUMMARY

The present invention provides a method for compensating errors caused by the Goos-Hanchen effect in an optical autofocus system that uses the position of an image reflected from a substrate (e.g. wafer surface) to determine changes in the z position of the substrate. According to the invention, reflected light from the substrate is provided at a plurality of wavelengths and polarizations, detected and used to make corrections that compensate for the errors due to the Goos-Hanchen effect.

One way of compensating GH errors, according to the principles of the present invention, is referred to as the “analog” approach. In this approach, a broad band light spectrum is directed at the substrate, so that the variation of GH error is minimized across various substrate patterns. This approach is already deployed, in a non-optimized way, in very broadband AF systems, where the influence of spectrally isolated GH errors are reduced by the more ubiquitous spectral components that have smaller GH errors. However, according to the present invention, the broad band illuminating spectrum is filtered, e.g. with a dynamic filter, or with a custom interference filter before reaching the detector so that the average GH error (averaged across wavelength and polarization by the detector) is minimized. This key idea behind this approach is that the GH error is an average over the spectrum and polarizations and that no further specialized data processing is necessarily used in correcting the GH error. We therefore call it the “analog approach”.

Another embodiment comprises a slight modification to the analog approach. In this case the position of the imaged source object, each wavelength, or wavelength band, and/or polarization receives a shift that biases the measured substrate position for that wavelength, wavelength band and/or polarization so that the average position is further compensated for GH errors. Such a bias could be achieved in a fringe projection system using a modified spectral filter that, in addition to attenuating the light as a function of wavelength, also applies a dynamic and differential phase shift (between +1 and −1 orders), which will shift the image of the source object on the detector for the wavelength, wavelength band and/or polarization concerned.

Another way of compensating GH errors, according to the principles of the present invention, is referred to as the “digital” or “digital filter” approach. With this approach, rather than modifying the spectrum in the incident and reflected light, a spectral and polarization filter is applied in software after each wavelength, wavelength band and/or polarization is detected separately in space, time or angle in accordance with the principles of the present invention. A single broadband spectrum, or many narrow band spectra, or a combination of broad and narrow band spectra are used to illuminate the substrate. The combined spectrum is then separated into several sub-bands and polarizations that are directed at one or more detectors that sense the position of the substrate and possibly its reflectance as a function of wavelength, polarization. Then the AF position is estimated with a weighted average among the spectral and polarization components, where the weighting (known as a digital filter) is made to reduce the overall variation in GH error across various substrate conditions for a given process (and/or a given imaging optical system). This type of system and method can be used with a fixed optical system with few or no moving parts.

In one version of the digital approach, broadband unpolarized illumination is used in imaging the object (e.g. slits or fringes) to the substrate. The light leaving the substrate is then separated into different polarizations and wavelengths that are then detected separately. The key point here is that the separation is done after reflection from the substrate. In this version of the method, a combination of dichroic and polarization beam splitters can be used to separate the measurements. In another embodiment, gratings can be used to perform the chromatic separation. In another preferred embodiment, the chromatic separation can be performed by a pair of prisms, a first prism that spreads the reflected light as collimated light in angles by wavelengths, and a second prism that is displaced from the first prism along the z axis, and makes the collimated rays at all the wavelengths parallel, so that the wavelengths are spatially separated, but their directions maintained. In either case of gratings, or prisms used to perform the chromatic separation, polarization beam splitters can be used to perform the polarization separation. In another embodiment, the polarization separation can be performed with polarizing elements placed directly in front of the detector elements that receive duplicate images of the of the source object.

In another version of the digital approach, the source object is illuminated by light that contains a plurality of wavelength bands such that each wavelength band is well separated in the far field image of the object. In this case, the different wavelengths can be picked off in the pupil of the relay optics following reflection by the substrate. With the wavelength well separated in the pupil, a set of mirrors or prisms can be used to direct each wavelength band to different detectors. In a preferred embodiment, a set of tilted mirrors is used to translate the image to different areas of a CCD. In this approach, polarization can be separated in the same way, or as is preferred, by a polarizing beams splitter that sends the beam to two separate CCDs.

In yet another version of the digital approach, the source object is illuminated sequentially in time by a plurality wavelength bands and polarizations. In this embodiment, the measurements at each polarization and wavelength band are also made sequentially in time.

In all of the disclosed versions of the invention, it is preferred that source object, that being imaged onto the substrate, and relayed to a detector, comprise a set of sinusoidal fringes produced by two-beam interference. Such fringes can be generated by illuminating a linear grating having twice the desired periodicity, and filtering the far field image such that only the +1 and −1 orders are allowed to reach the detector.

The present invention takes advantage of the fact that the GH error is significantly different across wavelengths and polarizations. Because of this, different spectra have different amounts of GH error for different substrate structures. And measurements made at a plurality of wavelengths and polarizations similarly contain information about those substrate structures.

Thus, the present invention compensates GH errors with or without detailed information about the substrate, without the complexities of ellipsometry (it can be thought of as an approximation or short-cut to ellipsometry), in a manner such that the GH error can be reduced to almost arbitrarily low levels.

Further aspects of the present invention will become apparent from the following detailed description and the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic illustration of a fringe type projection system, with which the present invention is particularly useful;

FIGS. 2a and 2b are schematic illustrations of one way of compensating the GH effect, in accordance with the principles of the present invention;

FIGS. 3a and 3b are schematic illustrations of additional aspects of compensating GH errors in an autofocus system, in accordance with the principles of the present invention; and

FIG. 4 is an illustration of a pair of prisms that can be used on the detection side of a system in practicing the principles of the present invention.

DETAILED DESCRIPTION

As described above, the present invention provides a new and useful concept for compensating errors due to the Goos-Hanchen (GH) effect in an autofocus (AF) system. The principles of the present invention are particularly useful in compensating GH errors in a fringe type AF system, and are described herein in connection with such a system. However, from that description, the manner in which the principles of the present invention can be applied to other AF systems (e.g. systems that use slit type detection) will be apparent to those in the art.

FIG. 1 schematically illustrates the principles of an autofocus (AF) system and method, using fringe projection. The system has a sending side 100, from which light is directed at a substrate 102, and a receiving side 104, wherein light reflected from the substrate is directed to a detector 106. On the sending side 100, the light (e.g. broadband or “white’ light) is reflected from a fringe generator 108, filtered by a filter 110 and reflected from the substrate 102. The fringe(s) of the reflected light is (are) detected at the detector 106, and used to determine the initial position of the substrate surface in the z direction. Subsequent operation of the AF system is then used to determine changes in the z position, and those changes may be used to control movement of the stage that supports the substrate in the z direction. The “fringe shift”, i.e. the amount by which the fringe is shifted, may be defined by y=2 m²z Tan θ, where z is the z position of the substrate, θ is the angle of incidence (shown in FIG. 1), m is the magnification between the substrate and detector, and y is the fringe position along the detector (shown in FIG. 1).

There are two (2) basic approaches to the method by which Goos-Hanchen errors may be compensated, according to the principles of the present invention; one is referred to herein as the “analog” approach, and the other is referred to herein as the “digital” or “digital filter” approach.

The analog approach is an extension of the idea that a different spectra produce different amounts of Goos-Hanchen (GH) error for a given surface, and that there is some spectrum that will minimize this error. Therefore to compensate the GH error, the method of the invention provides for adjusting the input spectrum, either by filtering the spectrum (and polarization) of a broad band source, or by varying the amount of light that is allowed to pass from a set of relatively narrow band sources. A related approach, which can still be referred to as an analog approach, is to introduce a wavelength (and polarization) dependent shift to fringes in fringe projection systems (or to the slit images in slit projection systems). This allows the implementation of negative spectral components. In practice a combination of these approaches may be beneficial. In these versions of the analog approach, the average autofocus signal (averaged over all wavelengths) will contain reduced GH error, and has the convenience of using few detector elements relative to the digital method.

In the digital approach, the optical AF signal is divided into spectral (and polarization) components at the detector. This can be done in time by pulsing the sources and alternating/rotating the polarization state (this could also be done to the light after it is incident on the substrate with appropriate chopping and/or switching mechanisms), or by separating the image spatially and sending the different wavelengths (and polarizations) to different detector elements—producing a plurality of AF measurements for a single position on the substrate. Once the plurality optical AF measurements have been made for a single position, they are combined by a weighted sum, where the weightings have been chosen to reduce the dependence on the GH error, much in the same way the spectrum was chosen in the analog approach.

In choosing the weights or the spectra; in order to effectively reduce the GH error, a set of weights or spectra shapes that achieve this goal must be found, and there are several possible approaches. For example, one way to determine the GH error as a function of the spectra or weights is by simulation, and then determining the best spectra or set of weights by some sort of optimization—e.g. simulated annealing, or damped least squares. Another approach, is to make a set of chromatic and polarization separated optical AF measurements on the target surface, and then also measure the surface by some other method that does not have GH errors (like an air-gauge, or touch profilometer) and then find the set of weights that reduces the GH error. This approach is easily amenable to a least squares solution and one skilled in the art will easily see that.

Another way to characterize the digital approach is that each wavelength (and polarization) is used to measure the z position of the substrate. This gives a set of measured z positions. Then, the weighted average (set of a and b coefficients) are applied to the set of z positions to give a single, final, corrected measurement of the substrate z position. In order to know the coefficients a, b, the AF system must be calibrated for a given substrate type. This can be done in at least 2 ways:

-   a) Use a physical sensor (air gauge, etc) to measure the height at     some set of positions on the substrate and compare this with the     results from the optical autofocus system. The air gauge or other     physical sensor can't be used for every substrate because it is much     slower and has a hard time measuring many points. This is done once     per process, establishing the set of {a, b} -   b) Print a test substrate for a given process and use the result of     the printing (which used the data from the optical AF to establish     the substrate at the right z height during printing) to determine     the error of the optical AF, and use this to determine the set of     {a, b}.

In the digital approach, it is possible to use other information (such as the amount of reflected light, for example) to sub-classify different regions on the substrate. Say half the substrate has copper (process A) and the other half has no copper (process B). There could be 2 sets of coefficients {a, b}, that could be preferentially applied depending on which region is being measured.

Also, in the digital approach, the light reflected by the substrate is used to determine changes in the z position of the substrate by applying a weighting average to the set of z measurements at the different wavelengths (λ's) and polarizations (e.g. s and p polarizations), and using the weighting average to make corrections that account for the Goos-Hanchen effect. Moreover, the weighting average is produced by a linear least squares regression estimate of the coefficients of the first order position of the substrate, preferably according to the following formula

$Z_{j} = {a_{0} + {\sum\limits_{n = 1}^{N}{\sum\limits_{{v = s},p}{a_{k,v}z_{j,n,v}}}} + {\sum\limits_{m = 1}^{M}{\sum\limits_{{v = s},p}{b_{k,v}r_{j,m,v}}}}}$

where

-   Z_(j) is the substrate height at position j on the substrate -   a_(o) is a DC offset value -   a_(k) is the set of coefficients for the z_(j)(lambda) measurements,     where there is a different j for each wavelength

$\sum\limits_{{v = s},p}$

This is a sum over the s and p polarization measurements

-   M is the number of wavelength bands -   r_(j,m,v) is the reflectance at     -   j^(th) position     -   m^(th) wavelength band (or sub-spectrum)     -   v^(th) polarization -   z_(j,n,v) is the z height at     -   j^(th) position     -   n^(th) wavelength band     -   v^(th) polarization -   b_(k) is the set of coefficients for the reflectance measurements of     the substrate, r_(j)(lambda). The same data collected and used to     measure z will also be used to calculate wavelength dependent values     for r, the reflectance. The set of b's are the weighting     coefficients for making a correction using this data. -   For a and b, the v subscript is for the two polarization states (s     and p).

By comparison, the analog approach would use the same type of calibration for determining the set of {a, b}, but it would be really hard to have spatially varying sets of coefficients (say, 2 sets for process A and B). The implementation of the filter would be done by attenuating the light at the various wavelengths and polarizations using a mechanical shutter, or filter. Then, the detector would measure z only once, and that measured z would be the result of applying the {a,b} coefficients directly on the light, rather than on the digital, calculated versions of z. The potential advantage of the analog method is that the data collected could be much less and the corrected z measurement could be determined very quickly.

FIGS. 2a and 2b schematically illustrate one version of providing an AF system and method that compensates errors due to the GH effect, which is particularly useful with the digital approach described herein. In the system and method of FIGS. 2a and 2 b, the light is directed at the substrate 102 at a plurality of wavelengths (λ's), the light at the plurality of wavelengths is reflected from the substrate (and from a reference mirror) and broken into different polarizations, and then detected. In this version of the method, separate light sources 107 of finite spectral width are incident on the substrate 102 (and on a reference mirror 115) at different angles such that they can be separated in angle space. The light from the sources is reflected from a fringe generating grating forming part of a fringe generating module 108 (and also from a reference region of the fringe generating module), and directed through a filter 110 before it is reflected from the substrate 102 and the reference mirror 115. The reflected light at the different wavelengths is spatially separated by prisms 109, and then separated by polarization by means of a polarization beam splitter PBS 105. Thus, reflected light at the different wavelengths and polarizations is directed to respective detectors 106 a, 106 b.

In another version of an autofocus system and method, according to the principles of the present invention, which is also particularly useful with the digital approach described herein, and shown schematically in FIGS. 3 a, 3 b, and 4, broadband light (or “white”) light from a broadband source is directed at a substrate 102 (and also at a reference mirror 115), (through a fringe generating module 108 and a filter 110, that are similar to the prior embodiment and therefore not shown). Broadband light reflected light from the substrate 102 and the reference mirror 115 is then magnified (e.g. by a magnification relay 111), broken into different polarizations and wavelengths that are then detected. In this version of the method, it is preferred that the broadband light reflected from the substrate (and the reference mirror) is refracted by a pair of prisms 117 a, 117 b, a first prism 117 a that spreads the reflected light as collimated light in angles by wavelengths, and a second prism 117 b that is displaced from the first prism along the z axis, and makes the collimated rays at all the wavelengths parallel, so that the wavelengths are spatially separated. The light is further separated by polarization, by polarization beam splitters similar to 105 (FIG. 2a ), and detected by detectors 106 a, 106 b.

In yet another version of an autofocus system and method, according to the principles of the present invention, which is also particularly useful with the digital approach described herein, the system would be set up in a manner similar to that shown in FIGS. 2 a, 2 b, but light that is directed at the substrate 102 comprises light from an illumination source that produces light sequentially at a plurality of wavelengths and a plurality of polarizations, and light at the plurality wavelengths and polarizations is detected sequentially. In this version, there would not be a need for the plurality of prisms 109 shown in FIGS. 2 a, 2 b.

In all of the disclosed versions of the invention, it is preferred that source object, that being imaged onto the substrate, and relayed to a detector, comprise a set of sinusoidal fringes produced by two-beam interference. Such fringes can be generated by illuminating a linear grating having twice the desired periodicity, and filtering the far field image such that only the +1 and −1 orders are allowed to reach the detector.

Thus, the basic concept of the present invention corrects for GH error in an AF system and method, by detecting light reflected from the substrate at different wavelengths and different polarizations, and using the detected light to compensate GH error. The principles of the present invention can be applied to either slit detection or fringe detection, but fringe detection is currently preferred. However, from this description, the manner in which the principles of the invention can be practiced with slit detection will be apparent to those in the art.

Also, in applying the principles of the present invention to an AF system and method, it should be noted that light directed at the substrate is preferably in a spectral range of 400 nm to 1000 nm so that it can work with the most commonly available glasses and detectors, but in principle can be any range of wavelengths that do not damage or alter the surface under investigation. The manner in which the light is directed at the substrate at a plurality of wavelengths and polarizations can take a number of forms, such as a single broadband light source that is filtered to produce the different wavelengths and different polarizations, or a plurality of light sources that produce light in narrow bands, and at different polarizations. The light at the plurality of wavelengths and polarizations is directed at the substrate (and the reference mirror), and reflected light from the substrate (and the reference mirror), at the plurality of wavelengths and polarizations, is directed to one or more detectors. The detected light is used to determine changes in the z position of the substrate being imaged. Preferably, the invention contemplates separate detection for each wavelength and each polarization. Moreover, the detector(s) can also measure the reflectance of the substrate, which is useful with the digital filter aspect of this invention, described further below.

The “analog” approach uses a broadband light source, and filters (e.g. dynamic filters or custom filters), or a number of multiplexed broadband or narrowband sources, to modify the spectrum (both wavelength and polarization) so that the variation of GH error is minimized across various substrate patterns. Thus, a broad band illuminating spectrum is filtered with a dynamic filter, or with a custom interference filter, to produce light at different wavelengths and polarizations in the light directed at the substrate and/or the light reflected from the substrate, and each of those different wavelengths and polarizations is detected simultaneously. Each sub-band and polarization is biased by the measurement system, so that, for example, individual wavelength bands and polarizations contain more or less power relative to the others and may also overestimate or underestimate the substrate position so that the final measurement has reduced sensitivity to GH shifts.

With the “digital filter” approach, a spectrum filter is applied in software. On the “sending” side of the substrate (i.e. from which light is directed at the substrate), the light is produced by a broadband source (that is filtered, e.g. by a turret filter) or several discrete light band sources, to direct light at the substrate at different wavelengths and at the different polarization (i.e. s and p polarizations) and possibly different angles of incidence. On the “receiving” side of the substrate (i.e. which receives the reflected light from the substrate) reflected light at the different wavelengths and polarizations is directed at one or more detectors, and the light at the different wavelengths and polarizations is separately measured at the detectors. The information from the detectors can then be used to determine variations of the z position of the substrate. The detector(s) can take various forms, e.g. CCD (charged couple device), individual slit detectors, etc., which would collect data so that the phase of the projected fringes can be calculated and the substrate height can be measured. Prisms or gratings are used to separate the fringes into finite wavelength bands, e.g. corresponding to pixel columns on the CCD. This allows for the use of a digital filter approach, as described herein.

The digital filter approach can be practiced with a serial or parallel approach, in terms of the way light is handled at the sending and receiving sides of the substrate. The “serial” approach provides a single detector that is sensitive to multiple wavelengths and polarizations on the receiving side, and a sending side that is switched between wavelengths and polarizations sequentially in time. The serial approach allows for switching between sources at discrete bands, or providing filters with broadband sources (e.g. turret type filters), or by switching between different sources having different spectra. If filters are used they can be on the sending and/or the receiving side of the substrate. With the “parallel approach”, all wavelengths and polarizations are projected simultaneously at the substrate; the wavelengths and/or polarizations may be split at the receiving side of the substrate and directed to multiple detectors. Then the AF position is estimated with a weighted average among the spectral and polarization components, where the weighting (analogous to a digital filter) is made to reduce the overall variation in GH error across various substrate conditions for a given process.

As specifically illustrated in FIGS. 3 a, 3 b and 4, where a broadband source is reflected from the substrate, in order to separate the wavelengths, a pair of prisms is used (117 a, 117 b). The first prism (117 a) spreads the collimated light out in angle by wavelength. The second prism (117 b) makes rays from all wavelengths parallel again, but since the second prism is displaced along the z-axis from the first prism, the colors have been spatially separated. This light is then incident on the detector(s).

With a system and method using the principles of the present invention, because the GH error is strongly dependent on wavelength, a small change in the spectrum of a broad-band source can have a noticeable effect on the error. Thus, the principles of the present invention contemplate using a spectrum where the GH error is minimized, modifying the illuminating spectrum with some sort of dynamic filter, measuring the slits or fringes at several wavelength bands and apply the spectrum in software as a digital filter.

The digital filter approach is particularly attractive (and is therefore currently preferred) because phase and reflectivity information can be generated, as well as irradiance (light intensity), all of which are related to changes in the z-position of the substrate and can easily be incorporated into a digital filter.

A system and method that practices the principles of the present invention can be thought of as a short-cut to ellipsometry where, instead of a priori knowledge, measurements from a sensor that is relatively immune to the GH effect can be used to determine the filter that best matches the measured z-position. Moreover, it may be possible to use the optical sensor data from several substrates to find a filter that minimizes the variation, possibly allowing for low order deformations of the substrate surface. In either case, the method could be largely automated.

With the digital approach, the method of finding the spectral filter is currently a linear least squares estimate of the coefficients of the first order position of the substrate as function of wavelength band and polarization, but it is recognized that there will be other approaches that can be used, and that higher order regression models involving other measureable quantities like reflectance can be included in the filter.

In one preferred embodiment of the digital filter approach, the optimal filter is calculated from a set of measured substrate heights z_(j)(□_(kv)) where j (from 1 to N) indicates the location of the measurement on the substrate, □_(kv) indicates the wavelength band k (from 1 to M) and polarization state v (e.g., either s, or p). At each of these locations, there is also a known substrate height Z_(j), which may be measured by an independent system that is relatively immune to the GH effects. Given this notation, we desire an estimate of the known height Z_(j) in terms of the spectrally measured heights z_(j)(□_(kv)). The simplest solution form is a linear combination of the measured heights,

$Z_{j} = {a_{0} + {\sum\limits_{k = 1}^{M}{\sum\limits_{{v = s},p}{a_{kv}{{z_{j}\left( \lambda_{kv} \right)}.}}}}}$

One approach for determining the coefficients is linear least squares regression using the following regression model (which is described in paragraph 0028).

$Z_{j} = {a_{0} + {\sum\limits_{n = 1}^{N}{\sum\limits_{{v = s},p}{a_{k,v}z_{j,n,v}}}} + {\sum\limits_{m = 1}^{M}{\sum\limits_{{v = s},p}{b_{k,v}{r_{j,m,v}.}}}}}$

where

-   Z_(j) is the substrate height at position j on the substrate -   a_(o) is a DC offset value -   a_(k) is the set of coefficients for the z_(j)(lambda) measurements,     where there is a different j for each wavelength

$\sum\limits_{{v = s},p}$

This is a sum over the s and p polarization measurements

-   M is the number of wavelength bands -   r_(j,m,v) is the reflectance at     -   j^(th) position     -   m^(th) wavelength band (or sub-spectrum)     -   v^(th) polarization -   z_(j,n,v) is the z height at     -   j^(th) position     -   n^(th) wavelength band     -   v^(th) polarization -   b_(k) is the set of coefficients for the reflectance measurements of     the substrate, r_(j)(lambda). The same data collected and used to     measure z will also be used to calculate wavelength dependent values     for r, the reflectance. The set of b′s are the weighting     coefficients for making a correction using this data. -   For a and b, the v subscript is for the two polarization states (s     and p).

In an extension of this approach, nonlinear combinations of the various measurements may be included in the model. For example, the square of each z-measurement z_(j)(□_(kv)), or cross terms like z_(j)(□_(kv)) z_(j)(□_(h)). Similarly, cross terms between reflectance measurements and cross terms between z and r can be used to make the correction, to any order deemed useful by the usual techniques of regression analysis or by careful analysis of the ellipsometric relationships.

A further extension of this approach could include measurements of substrate height and reflectance at multiple angles of incidence and/or substrate orientation (i.e. clocking).

Also, one could use a nonlinear regression, for example with polynomial basis as follows:

$Z_{j} = {a_{0} + {\sum\limits_{n = 1}^{N}{\sum\limits_{q = 1}^{Q}{\sum\limits_{{v = s},p}{a_{k,v}z_{j,n,v}^{q}}}}} + {\sum\limits_{m = 1}^{M}{\sum\limits_{v = 1}^{T}{\sum\limits_{{v = s},p}^{\;}{b_{k,v}r_{j,m,v}^{t}}}}}}$

It is also preferred that the projected pattern (either slits or fringes) at all wavelengths have the same period and phase, and that all of the wavelengths are looked at simultaneously, or separated with a prism/grating such that each band corresponds to one region of an area detector (e.g., a charge coupled device (CCD)). Those principles would utilize, e.g., a mirror array placed conjugate to the substrate.

Thus, the foregoing description provides several new and useful ways of compensating for errors due to the Goos-Hanchen effect in an optical autofocus system and method that uses light reflected from a substrate to determine changes in the z position of the substrate. The correction is performed through the use of reflected light from the substrate at a plurality of wavelengths and polarizations that is detected and used to make corrections to the z position of the substrate that compensate for the errors due to the Goos-Hanchen effect. From the foregoing description, other ways of compensating for errors due to the Goos-Hanchen effect in an autofocus system and method, using the principles of the present invention, will become apparent to those in the art. 

1. A surface position detecting apparatus comprising: a projection system configured to project light onto a surface of a chosen workpiece to form a pattern of light distribution thereon, the pattern including a first area with a first level of illumination and a second area with a second level of illumination, the first level being lower than the second level; a light detecting system including a polarization filter unit configured to receive light said light in reflection from the surface of the workpiece and to separate said light into a first light portion having a first state of polarization and a second light portion having a second state of polarization, the first and second states of polarization being different from one another; a first optical detector unit disposed (i) to acquire the first light component, (ii) to detect, in said first light component, a first distribution of light corresponding to said pattern, and (iii) to generate a first output signal in response to having detected said first distribution of light; a second optical detector unit disposed (a) to acquire the second light component (b) to detect, in said second light component, a second distribution of light corresponding to said pattern, and (c) to generate a third output signal in response to having detected said second distribution of light; and a controller unit operably cooperated with the light detecting systems and configured to calculate a position of the surface of the workpiece based on the first and second output signals.
 2. The surface position detecting apparatus according to claim 1, configured to cause said first light portion to contain a first amount of Goos-Hanchen shift originated as a result of interaction of said light with the workpiece, and to cause said second light portion to contain a second amount of Goos-Hanchen shift originated as a result of interaction of said light with the workpiece, wherein the first and second amounts of Goos-Hanchen shifts are different from one another.
 3. The surface position detecting apparatus according to claim 2, wherein the controller unit is configured to reduce an error, contributed to said position of the surface of the workpiece as a result of Goos-Hanchen shift experienced as the workpiece by said light, with the use of the first and second output signals.
 4. The surface position detecting apparatus according to claim 1, wherein the light detecting system includes a first condensing optical system disposed to condense light received by it from the surface of the workpiece; a second condensing optical system disposed between the polarization filter unit and the first optical detector unit to make the surface of the workpiece and a detecting surface of the first optical detector unit be optically-conjugate to one another and to condense the first light portion; a third condensing optical system disposed between the polarization filter unit and the second optical detector unit to make the surface of the workpiece and a detecting surface of the second optical detector unit be optically-conjugate to one another and to condense the second light portion.
 5. The surface position detecting apparatus according to claim 4, wherein the projection system includes a light source configured to emit said light, said light being polychromatic.
 6. The surface position detecting apparatus according to claim 5, wherein the light detecting system includes a spectroscope arranged between the first condensing optical system and the polarization filter unit to spatially split said light received from the surface of the workpiece according to wavelengths.
 7. The surface position detecting apparatus according to claim 1, further including a light source configured to emit said light, and wherein the projection system includes a diffraction grating disposed to receive said light and diffract to form diffracted light; and a light sending optical system between the diffraction grating to condenses the diffracted light on the surface of the workpiece to form said pattern thereon.
 8. The surface position detecting apparatus according to claim 7, wherein the projection system further include a spatial filter disposed to transmit positive and negative diffraction orders of said diffracted light and to block the 0th diffraction order thereof.
 9. The surface position detecting apparatus according to claim 8, wherein the spatial filter is configured to transmit a +1 diffraction order and a −1 diffraction order of said diffracted light.
 10. The surface position detecting apparatus according to claim 8, wherein the light source is configured to emit said light at a plurality of wavelengths, and wherein the positive diffraction orders and the negative diffraction orders for said plurality of wavelengths pass through the spatial filter.
 11. The surface position detecting apparatus according to claim 1, wherein the pattern is projected to a measurement area on the surface of the workpiece.
 12. The surface position detecting apparatus according to claim 11, wherein the pattern includes a spatially-periodic pattern having a period along a longitudinal direction of the measurement area.
 13. An exposure apparatus configured to expose said workpiece to light, the apparatus comprising: the surface position detecting apparatus according to claim 1; and a stage configured to hold said workpiece.
 14. The exposure apparatus of claim 13, wherein the stage changes a posture and/or position of the workpiece, the exposure apparatus further comprising a controller configured to control the posture and/or position of the workpiece by using an output from the surface position detecting apparatus.
 15. A manufacturing method comprising: exposing the workpiece to light by using the exposure apparatus of claim 13 to form an exposed workpiece carrying a predetermined pattern on the surface of the workpiece; developing the exposed workpiece; forming a mask layer dimensioned to have a form corresponding to the predetermined pattern on the surface of the workpiece; and processing the surface of the workpiece via the mask layer.
 16. A surface position determining apparatus having an optical axis and operable to produce an output representing a height profile of a surface being imaged, the apparatus comprising: a pattern-generating portion configured to form a light pattern on the surface in incident polychromatic light, wherein said light pattern includes alternating linearly-extended areas of high and low light irradiance; a pattern-projecting portion configured to image said light pattern onto an image plane defined by an optical detection system; and an optical filter including at least one of a dynamic filter and an interference filter disposed across the optical axis to modify at least one of a spectrum, state of polarization, phase, and irradiance of light delivered to said image plane such as to compensate an error contributed to said output by a Goos-Hanchen shift experienced by the incident polychromatic light at said surface, wherein the error is defined as an error averaged over wavelengths and polarizations of light incident onto said optical detector.
 17. An apparatus according to claim 16, wherein said pattern-projecting portion includes an optical element structure to change a curvature of a wavefront of light relayed by said optical element towards the image plane, and wherein said image plane is tilted with respect to an axis defined by said optical element.
 18. An apparatus according to claim 16, further including an autofocus (AF) system, wherein the optical detection system is structured to detect said light pattern, projected onto the imaging plane, simultaneously at different wavelengths and polarizations.
 19. An apparatus according to claim 16, further including an autofocus (AF) system, wherein said optical filter is structured to introduce a phase shift between beams of light delivered by the pattern-generating portion to the substrate to form said light pattern, said phase shift being wavelength-dependent and polarization-dependent.
 20. An apparatus according to claim 16, wherein the pattern-generating portion is structured to transmit only +1 and −1 orders of diffraction of said incident polychromatic light towards the substrate, and wherein said light pattern includes linear interference fringes formed by light beams that respectively correspond to said +1 and −1 orders of diffraction.
 21. An apparatus according to claim 20, wherein said optical filter is further structured to introduce a dynamic differential phase shift between light beams corresponding to said +1 and −1 orders of diffraction to introduce a spatial offset in said output, and wherein the spatial bias is dependent on at least one of a wavelength and a state of polarization of light incident onto the image plane.
 22. An apparatus according to claim 16, including an autofocus (AF) system and further comprising a reference reflector operably associated with the surface being measured, said reference reflector configured to deliver at least a portion of said incident polychromatic light to the image surface, and a stage configured to hold a workpiece having said surface. 